Perfect Matchings in 0(n log n) Time in Regular Bipartite Graphs 

Ashish Goel* Michael Kapralov^ Sanjeev Khanna* 

April 13, 2010 



Abstract 

In this paper we consider the well-studied problem of finding a perfect matching in a d-regular bipartite 
graph on 2n nodes with m = nd edges. The best-known algorithm for general bipartite graphs (due to 
Hopcroft and Karp) takes time 0{m^/n). In regular bipartite graphs, however, a matching is known to be 
computable in 0{m) time (due to Cole, Ost, and Schirra). In a recent line of work by Goel, Kapralov, and 
Khanna the 0{m) time bound was improved first to O (min{m, n^-^/c?}) and then to O (miiijm, n^/d}). 

In this paper, we give a randomized algorithm that finds a perfect matching in a d-regular graph and 
runs in 0{n log n) time (both in expectation and with high probability). The algorithm performs an ap- 
propriately truncated alternating random walk to successively find augmenting paths. Our algorithm may 
be viewed as using adaptive uniform sampling, and is thus able to bypass the limitations of (non-adaptive) 
uniform sampling established in earlier work. Our techniques also give an algorithm that successively 
finds a matching in the support of a doubly stochastic matrix in expected time 0(71 log^ n), with 0{m) 
pre-processing time; this gives a simple 0{m + mnlog^ 71) time algorithm for finding the Birkhoff-von 
Neumann decomposition of a doubly stochastic matrix. 

We show that randomization is crucial for obtaining o{nd) time algorithms by establishing an Q{nd) 
lower bound for deterministic algorithms. We also show that there does not exist a randomized algorithm 
that finds a matching in a regular bipartite multigraph and takes 0(71 log n) time with high probability. 
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1 Introduction 



A bipartite graph G = {P, Q, E) with vertex set P U Q and edge set C P x Q is said to be d-regular if 
every vertex has the same degree d. We use m = nd to denote the number of edges in G and n to represent the 
number of vertices in P (as a consequence of regularity, P and Q have the same size). Regular bipartite graphs 
have been studied extensively, in particular in the context of expander constructions, scheduling, routing in 
switch fabrics, and task-assignment llT3l [ni6l. 

A regular bipartite graph of degree d can be decomposed into exactly d perfect matchings, a fact that is an 
easy consequence of Hall's theorem 131 and is closely related to the Birkhoff-von Neumann decomposition of 
a doubly stochastic matrix ||2j[l5l. Finding a matching in a regular bipartite graph is a well-studied problem, 
starting with the algorithm of Konig in 1916 |[T2l . which is now known to run in time 0{mn). The well-known 
bipartite matching algorithm of Hopcroft and Karp iHTI can be used to obtain a running time of 0{m^/n). In 
graphs where d is a power of 2, the following elegant idea, due to Gabow and Kariv HI, leads to an algorithm 
with 0{m) running time. First, compute an Euler tour of the graph (in time 0(m)) and then follow this tour in 
an arbitrary direction. Exactly half the edges will go from left to right; these form a regular bipartite graph of 
degree d/2. The total running time T{m) thus follows the recurrence T{m) = 0{m) + T(m/2) which yields 
T{m) = 0{m). Extending this idea to the general case proved quite hard, and after a series of improvements 
(e.g. by Cole and Hopcroft f5T|, and then by Schrijver |[T4l to 0{md)), Cole, Ost, and Schirra |]6l gave an 
0(m) algorithm for the case of general d. Their main interest was in edge coloring of general bipartite 
graphs, where finding perfect matchings in regular bipartite graphs is an important subroutine. 

Recently, Goel, Kapralov, and Khanna lITOl . gave a sampling-based algorithm that computes a perfect 
matching in d-regular bipartite graphs in 0(min{m, - — j^}) expected time, an expression that is bounded 
by 0(?i^-^^). The algorithm of fTQ uses uniform sampling to reduce the number of edges in the input graph 
while preserving a perfect matching, and then runs the Hopcroft-Karp algorithm on the sampled graph. The 
authors of lITOl also gave a lower bound of Q. ^minjnd, on the running time of an algorithm that uses 
non-adaptive uniform sampling to reduce the number of edges in the graph as the first step. This lower bound 
was matched in [91, where the authors use a two stage samphng scheme and a specialized analysis of the 
runtime of the Hopcroft-Karp algorithm on the sampled graph to obtain a runtime of O ( minjnfi, ^} ) . 



For sub-linear (in m) running time algorithms, the exact data model is important. In this paper, as well 
as in the sub-linear running time algorithms mentioned above, we assume that the graph is presented in the 
adjacency aiTay format, i.e., for each vertex, its d neighbors are stored in an array. This is the most natural 
input data structure for our problem. Our algorithms will not make any ordering assumptions within an 
adjacency array. 

Given a partial matching in an undirected graph, an augmenting path is a path which starts and ends at an 
unmatched vertex, and alternately contains edges that are outside and inside the partial matching. Many of 
the algorithms mentioned above work by repeatedly finding augmenting paths. 

1.1 Our Results and Techniques 

Our main result is the following theorem. 

Theorem 1 There exists a randomized algorithm for finding a perfect matching in a d-regular bipartite graph 
G = {P, Q, E) given in adjacency array representation, and takes time O(nlogn) time both in expectation 
as well as with high probability. 

The algorithm is very simple: the matching is constructed by performing one augmentation at a time, 
and new augmenting paths are found by performing an alternating random walk with respect to the cun^ent 
matching. The alternating random walk on G, defined in Section |2j can be viewed as a random walk on 
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a modified graph that encodes the current matching. The random walk approach may still be viewed as 
repeatedly drawing a uniform sample from the adjacency array of some vertex v; however this vertex v is now 
chosen adaptively, thus allowing us to bypass the Q, ^mm{nd, lower bound on non-adaptive uniform 
sampling established in |[TOl . Somewhat surprisingly, we show that the total time taken by these random 
augmentations can be bounded by 0(n log n) in expectation, only slightly worse than the Q{n) time needed 
to simply output a perfect matching. The proof involves analyzing the hitting time of the sink node in the 
random walk. We also establish that randomization is crucial to obtaining an o{nd) time algorithm. 

Theorem 2 For any 1 < d < n/12, there exists a family of d-regular graphs on which any deterministic 
algorithm for finding a perfect matching requires Q{nd) time. 

We also give a lower bound on the running time of any randomized algorithm for finding a perfect match- 
ing in d-regular bipartite multigraphs, even with edge multiplicities bounded above by d/2: 

Theorem 3 Let A be any randomized algorithm that finds a matching in a d-regular bipartite multigraph 
with n nodes and edge m.ultiplicities bounded above by d/2. Then there exists a family of graphs for which A 
probes at least (7/64) n In n locations in the input adjacency arrays with probability at least n~'^. 

We note that the algorithm of Theorem [T] takes 0(n log n) time with high probability on multigraphs with 
edge multiplicities less than d/2 and hence, the lower and upper bounds are tight in this particular case. It is 
also interesting to contrast Theorem [3] with the result of |[T6l . which shows that sampling a constant number of 
edges incident to every vertex of a complete bipartite graph yields a subgraph that contains a perfect matching 
with high probability, i.e. the sampling complexity is 0{n) even if a high probability result is desired. The 
lower bound on the randomized algorithm is not as comprehensive as the deterministic lower bound: it holds 
only for very specific values of d (specifically, d = 0(n)), it bounds the "with high probability"-running 
time as opposed to the expected running time, and it works for multi-graphs. Obtaining tight upper and lower 
bounds for the entire range of parameters and for expected running time remains an interesting open problem. 

Our techniques also extend to the problem of finding a perfect matching in the support of a doubly- 
stochastic matrix, as well as to efficiently compute the Birkhoff-von-Neumann decomposition of a doubly 
stochastic matrix. 

Theorem 4 Given an n x n doubly-stochastic matrix M with m non-zero entries, one can find a perfect 
matching in the support of M in 0{n log^ n) expected time with 0{m) preprocessing time. 

In many applications of Birkhoff von Neumann decompositions (e.g. routing in network switches HI), 
we need to find one perfect matching in a single iteration, and then update the weights of the matched edges. 
In such applications, each iteration can be implemented in 0{n log^ n) time (after initial 0{m) preprocessing 
time), improving upon the previous best known bound of 0{mb) where b is the bit precision. 

Corollary 5 For any k >\, there exists an 0{m + kn log^ n) expected time algorithm for finding k distinct 
matchings (if they exist) in the Birkhoff-von-Neumann decomposition of cm n x n doubly stochastic matrix 
with m non-zero entries. 

Finally, we note that an application of Yao's min-max theorem (see, for instance, |[13|) to Theorem [J 
immediately yields the following corollary: 

Corollary 6 For any distribution on regular bipartite graphs with 2n nodes, there exists a deterministic 
algorithm that runs in average time 0(n log n) on graphs drawn from this distribution. 

A similar corollary also follows for doubly stochastic matrices. 
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1.2 Organization 

Section|2]gives the 0{n log n) time algorithm to find a perfect matching, and establishes Theorem[T] Building 
on the ideas developed in Section |2l we present in Section [3?T] algorithms for finding matchings in doubly- 
stochastic matrices, and computing a Birkhoff-von-Neumann decomposition, establishing Theorem |4] and 
Corollary [5] In Section IH we present an Q{nd) lower bound for any deterministic algorithm that finds a 
perfect matching in a d-regular graph. Finally, in Section |5] we present an il(nlogn) lower bound for any 
algorithm that finds a matching in a regular bipartite multigraph with high probability. 

2 Matchings in cZ-Regular Bipartite Graphs 
2.1 The Basic Algorithm 

Let G = {P, Q, E) denote the input d-regular graph and let Af be a partial matching in G. We first describe 
the alternating random walk on G with respect to M. We assume that the algorithm has access to the function 
SAMPLE-OUT-EDGE that takes a vertex u £ P and returns a uniformly random unmatched edge going out 
of u. The implementation and runtime of SAMPLE-OUT-EDGE depend on the representation of the graph. It 
is assumed in Theorem[T]and in this section that the graph G does not have parallel edges and is represented in 
adjacency array format, in which case SAMPLE-OUT-EDGE can be implemented to run in expected constant 
time. In Theorem IH however, a preprocessing step will be required to convert the matrix to an augmented 
binary search tree, in which case SAMPLE-OUT-EDGE can be implemented to run in 0(log n) time. 

The alternating random walk starts at a uniformly random unmatched vertex uq G P and proceeds as 
follows: 

1. Setu :=SAMPLE-OUT-EDGE(nj); 

2. If V is matched, set Uj+i := M{v), otherwise terminate. 

Note that an augmenting path with respect to M can be obtained from the sequence of steps taken by the 
alternating random walk by removing possible loops. 
We now state a basic version of our algorithm: 

Algorithm 1 

Input: A d-regular bipartite graph G = (P, Q, E) in adjacency array format. 
Output: A perfect matching of G. 

1. Set j := 0, Mo := 0. 

2. Run the alternating random walk starting from a random unmatched vertex in P until it hits an unmatched 

vertex in Q. 

3. Denote the augmenting path obtained by removing possible loops from the sequence of steps taken by the 

walk by p. Set A/j+i := MjAp. 

4. Set j := j + I and go to step 2. 

We prove in the next section that this algorithm takes 0(n log 7i) time in expectation. The high probability 
result is obtained in section 12.31 by performing appropriately truncated random walks in step 2 instead of a 
single untruncated walk. 
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2.2 Expected Running Time Analysis 



The core of our analysis is the following lemma, which bounds the time that it takes an alternating random 
walk in G with respect to a partial matching M that leaves 2k vertices unmatched to reach an unmatched 
vertex. 

Lemma 7 Let G = {P, Q,E) be a d-regular bipartite graph and let M be a partial matching that leaves 2k 
vertices unmatched. Then the expected number of steps before the alternating random walk in G reaches an 
unmatched vertex is at most 1 + n/k. 



It will be convenient to use the auxiliary notion of a matching graph H{G, M) which will allow us to 
view alternating random walks in G with respect to M as random walks in H{G, M) starting from a special 
source node s and hitting a special sink node t. We then get the result by bounding the hitting time from s to 
tmH{G,M). 

The matching graph con^esponding to the matching M is defined to be the directed graph H obtained by 
transforming G as follows: 

1. Orient edges of G from P to Q; 

2. Contract each pair (n, f ) G M into a supernode; 

3. Add a vertex s connected by d parallel edges to each unmatched node in P, directed out of s; 

4. Add a vertex t connected by d parallel edges to each unmatched node in Q, directed into t. 

We now state some properties of the graph H. The graph H has n + k + 2 nodes and n{d — 1) + k{2d + 1) 
edges. Note that for every vertex v ^ H, v ^ s,t the in-degree of v is equal to its out-degree. The out-degree 
of s equals dk, as is the in-degree of t. Finally, a random walk in H starting from s reaches t with probability 
1, and, most importantly, such a walk corresponds to an alternating random walk in G with respect to AI. 

We now prove that the expected time that it takes a random walk in H{G) starting from s to reach t is at 
most 1 + n/k. 

For a vertex i G V{H), we denote its out-degree by deg(i). Also, let P denote the transition matrix of the 
random walk, that is: 



Denote the vector of out-degrees by deg(-), the standard basis vector corresponding to a vertex j by Cj. 
Note that in fact the out-degree of the nodes of H can only assume 4 values: (1) the starting vertex s has 
deg(s) = dk, (2) all unmatched nodes u have deg(ti) = d, (3) all supernodes v have deg(f ) = d — 1, and (4) 
deg(t) = 0. 

We assume wlog that all vertices are reachable from s since those that are not do not influence the run- 
time of the algorithm. It then follows that t is reachable from any vertex of H in at most n + 3 steps, i.e. 
||(P^)"+^||oo < 1, so (I — P-^)^^ exists. The expected number of visits to a vertex j G V{H) during the 
random walk is given by 



Proof: 





(1) 



Hence, the expected number of steps before reaching t is 
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Note that for all j ^ twe have 
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k>0 



T T 

et = Cj et + Cj 



fc>0 



since efP'^et = Pu = for all i. 
Hence, we can rewrite ([T]) as 
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Cs — Cj 
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A;>0 




fc>0 
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However, we have that 



To verify equation ([4]), we calculate 



(l-P^)~\es-et) 



deg(-) 
dk ■ 



a _ pT^Mi - Mi _ pT^i - e - e. 



which follows from the following case analysis: 
Case (1): j / s, t Then 



E 



1 



deg(0 



deg(0 = deg(j) 



l(iV{H):(l,j)(iE{H) 

since the in-degree of every j G V{H),j ^ s, t, is equal to its out-degree. 
Case (2): j = s Then 

e^P^deg(-) = EP'^ deg(/) = 0. 



Case (3): j = t Then 



e?^P^deg(-) = EP'tdeg(0 
1 



I 



^ deg(/) 



deg(Z) = dk 



since the in-degree of t is dk. 
Hence, the contribution from vertex j is 



eJ[I-P-]-'fe-e,) = ^^, 

SO the expected number of steps before the random walk reaches t is at most 

^ deg(i) ^ (n - A;) + 2fc ^ ^ ^ n 



jeV{H),j^s,t 



dk 



We can now prove 



Theorem 8 Algorithm 1 finds a matching in a d-regular bipartite graph G = {P, Q, E) in expected time 
0{n log n). 

Proof: By Lemma|7]it takes at most \ + n/{n — j) expected time to find an augmenting path with respect to 
partial matching Mj . Hence, the expected runtime of the algorithm is bounded by 

n-l 

1 + n/{n — j) = n + nHn = 0{n log n), 

j=0 

where H{n) := 1 + 1/2 + 1/3 + . . . + 1/n is the n-th Harmonic number. ■ 
2.3 Truncated Random Walks and High Probability Analysis 

In this section we show how Algorithm 1 can be modified by introducing truncated random walks to obtain a 
running time of 0{n log n) whp. 

Algorithm 2 

Input: A d-regular bipartite graph G = {P, Q, E) in adjacency array format. 
Output: A perfect matching of G. 

1. Set j := 0, Mo := 0. 

2. Repeatedly run alternating random walks for 2 ^1 + steps until a successful run is obtained. 

3. Denote the augmenting path obtained by removing possible loops from the sequence of steps taken by the 

walk by p. Set Mj+i := MjAp. 

4. Set j := j + 1 and go to step 2. 

We now analyze the running time of our algorithm, and prove Theorem [T] 
Proof of Theorem [TJ 

We now show that Algorithm 2 takes time 0(n log n) whp. First note that by Lemma |7] and Markov's 
inequality, a truncated alternating random walk in step 2 succeeds with probability at least 1/2. Let Xj 
denote the time taken by the j-th augmentation. Let Yj be independent exponentially distributed with mean 

— i 

/^i ln2- 



qbj In 2 



Pr[Y, > qb,] 



Pr[Xj > qbj] < 2"'? = exp 
for all g > 1, so 

Pr[Xj >x]< Pr[Yj > x] (5) 

for all X > bj. We now prove that Y := J2o<j<n-i ^ — '^^ w.h.p. for a suitably large positive constant 
c. Denote p := E[y]. By Markov's inequality, for any t,6 > 

Pr[y > (1 + 6)fj] < ^ ^ 



at{l + 5)fl ■ 

Also, for any j, and for t < l/fij,we, have 



1 f °° 1 
flj Jo 1 - tHj 
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The two expressions above, along with the fact that the Yfs are independent, combine to give: 

-t{l+5)ij. 

Pr[y>(l + JM< (6) 

Observe that is the largest of the /ij's. Assume that t = 2jr~^' which implies that (1 — tfij) > e"*'^^ 
Plugging this into equation [6l we get: 

Pr[y > (1 + 6)fi] < e-(i+^-i'^4W(2M„-i)_ (7) 

Further observe that fi = 2n/ln2 + - 2/ln2)H{n) > ^n-iH{n), where H{n) := 1 + 1/2 + 1/3 + 

. . . + l/?i is the 72-th Harmonic number. Since H{n) > Inn, we get our high probabihty result: 

Pr[y > (1 + S)fi] < n-(i+^-i'^4)/2_ (^g) 
Since jj, = 0{n log n), this completes the proof of Theorem [1] ■ 



3 Matchings in Doubly-Stochastic Matrices and Regular Bipartite Multigraphs 

3.1 Doubly-Stochastic Matrices 

We now apply techniques of the previous section to the problem of finding a perfect matching in the support 
of an n X n doubly stochastic matrix M with m non-zero entries. A doubly-stochastic matrix can be viewed as 
a regular graph, possibly with parallel edges, and we can thus use the same algorithm and analysis as above, 
provided that SAMPLE-OUT-EDGE can be implemented efficiently. We start by describing a simple data 
structure for implementing SAMPLE-OUT-EDGE. For each vertex v, we store all the outgoing edges from 
w in a balanced binary search tree, augmented so that each node in the search tree also stores the weight of 
all the edges in its subtree. Since inserts into, deletes from, and random samples from this augmented tree all 
take time O(logn) Q, we obtain a running time of O(nlog^n) for finding a matching in the support of a 
doubly stochastic matrix. 

Superficially, it might seem that initializing the balanced binary search trees for each vertex takes total 
time 6(mlogn). However, note that there is no natural ordering on the outgoing edges from a vertex, and 
we can simply superimpose the initial balanced search tree for a vertex on the adjacency aiTay for that vertex, 
assuming that the underlying keys are in accordance with the (arbitrary) order in which the edges occur in the 
adjacency array. 

The complete Birkhoff-von Neumann decomposition can be computed by subtracting an appropriately 
weighted matching matrix from M every time a matching is found, thus decreasing the number of nonzero 
entries of M. Note that the augmented binary seaixh tree can be maintained in O(logn) time per deletion. 
This yields the algorithm claimed in Corollary |5] 

3.2 Regular Bipartite Multigraphs 

For regular bipartite multigraphs with edge multiplicities at most d/2, Algorithm 2 still takes time at most 
0{n log n) with high probability, since SAMPLE-OUT-EDGE can be implemented by sampling the adjacency 
list of the appropriate vertex in G until we find an unmatched edge. Each sample succeeds with probability 
at least 1/2 since the matched edge can have multiplicity at most d/2. Here, we assume that an edge with 
multiplicity k occurs k times in the adjacency arrays of its endpoints. 

We also note that our algorithm can be implemented to run in 0{n\ogn) time without any assumptions 
on multiplicities if the data layout is as follows. For each vertex we have an adjacency array with edges 
of multiplicity k appearing as contiguous blocks of length k. Also, each element in the adjacency array is 
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augmented with the index of the beginning of the block con^esponding to its edge and the index of the end 
of the block. Assuming this data layout, SAMPLE-OUT-EDGE can be implemented in 0(1) expected time 
regardless of edge multiplicities: it is sufficient to sample locations outside the block corresponding to the 
cunently matched edge. 

It remains to note that when the size of the support of the set of edges, which we denote by m^, is small, 
then the data representation used in finding a matching in a doubly-stochastic matrix can be used to find a 
matching in time 0(ms+n log^ n). It is interesting to compare this runtime to the result of [6]. The runtime of 
their matching algorithm is stated as 0(m + n log^ d) = 0{rn), but it is easy to see that it can be implemented 
to run in 0(171 s + n log^ d) time. 

Remark 9 Our algorithm can be used to obtain a simple algorithm for edge-coloring bipartite graphs with 
maximum degree d in time 0{m log n) (slightly worse than the best known 0(m log d) dependence obtained 
in In the first step one reduces the problem to that on a regular graph with 0(m) edges as described in 
The lists of neighbors of every vertex of the graph can then be arranged in a data structure that supports 
sampling and deletion in 0(1) amortized time. It remains to find matchings repeatedly, taking 0(n\ogn) 
time per matching. This takes 0(nd\ogn) = 0(m log n) time overall. 

4 An Q(nd) Lower Bound for Deterministic Algorithms 

In this section, we will prove Theorem |2] We will show that for any positive integer d, any deterministic 
algorithm to find a perfect matching in a d-regular bipartite graph requires il(nd) probes, even in the adjacency 
array representation, where the ordering of edges in an array is decided by an adversary. Specifically, for any 
positive integer d, we construct a family Q(d) of simple d-regular bipartite graphs with 0(d) vertices each 
that we refer to as canonical graphs. A canonical bipartite graph G(P U {t}, Q U {s}, E) G Q(d) is defined 
as follows. The vertex set P = PiVJ P2 and Q = Q1VJQ2 where |Pj| = \Qi\ = 3d for i € {1, 2}. The vertex 
s is connected to an arbitrary set of d distinct vertices in Pi while the vertex t is connected to an arbitrary set 
of d distinct vertices in Q2. In addition, G contains a matching M' of size d that connects a subset Q[ C Qi 
to a subset P2 ^ P2, where jQ'il = 1^2 1 = ^- The remaining edges in E connect vertices in Pi to Qi for 
i G {1, 2} so as to satisfy the property that the degree of each vertex in G is exactly d. It suffices to show an 
ri((i^) lower bound for graphs drawn from Q(d) since we can take Q(n/d) disjoint copies of canonical graphs 
to create a li-regular graph on n vertices. 

Overview: Let D be a deterministic algorithm for finding a perfect matching in graphs drawn from G(d). 
We will analyze a game between the algorithm V and an adaptive adversary A whose goal is to maximize 
the number of edges that V needs to examine in order to find a perfect matching. In order to find a perfect 
matching, the algorithm V must find an edge in M', since s must be matched to a vertex in Pi, and thus in 
turn, some vertex in Qi must be matched to a vertex in P2. We will show that the adversary A can always 
force V to examine Q,(d'^) edges in G before revealing an edge in M'. The specific graph G € Q(d) presented 
to the algorithm depends on the queries made by the algorithm V. The adversary adaptively answers these 
queries while maintaining at all times the invariant that the partially revealed graph is a subgraph of some 
graph G G Q(d). The cost of the algorithm is the number of edge locations probed by it before A reveals an 
edge in M' to V. 

In what follows, we assume that the adversary reveals s, t and the partition of remaining vertices into 
Pj, Qi for 1 < i < 2, along with all edges from s to Pi and all edges from t to Q2, to the deterministic 
algorithm V at the beginning. The algorithm pays no cost for this step. 

Queries: Whenever the algorithm V probes a new location in the adjacency aiTay of some vertex n G P U Q, 
we will equivalently view V as making a query Q(u) to the adversary A, in response to which the adversary 
outputs a vertex v that had not been yet revealed as being adjacent to u. 
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Subgraphs consistent with canonical graphs: Given a bipartite graph G'{PU{t}, Qu{s}, E'), we say that 
a vertex u ^ P U Q is free if its degree in G' is strictly smaller than d. We now identify sufficient conditions 
for a partially revealed graph to be a subgraph of some canonical graph in Q{d). 

Lemma 10 Let Gr{P U {t}, Q {s} , Ej) be any simple bipartite graph such that 

(a) the vertex s is connected to d distinct vertices in Pi and the vertex t is connected to d distinct vertices 
in Q2, 

(b) all other edges in Gr connect a vertex in Pi to a vertex in Qifor some i G {1,2}, 

(c) degree of each vertex in Gr is at most d, and 



(d) at least ^ vertices each in both Qi and P2 have degree strictly less than 



5- 



Then for any pair u, v of free vertices such that u £ Pi and v G Qifor some i £ {1, 2}, and (u, v) Er, 
there exists a canonical graph G{P U {t}, Q U {s}, E) G Q{d) such that that Er U (n, v) C E. 

Proof: Let G'{P U {t},QU {s}, E') be the graph obtained by adding edge (n, v) to Gr, that is, E' = 
Er^J {{u,v)]. Since u and v are free vertices, all vertex degrees in G' remain bounded by d. We now show 
how G' can be extended to a d-regular canonical graph. 

We first add to G' a perfect matching M' of size d connecting an arbitrary set of dfree vertices in Qi to an 
arbitrary set of dfree vertices in P2- This is feasible since G' has at least ^ free vertices each in both Qi and 
P2. In the resulting graph, since the total degree of all vertices in Pi is same as the total degree of all vertices 
in Qi, we can repeatedly pair together a vertex of degree less than d in Pi with a vertex of degree less than d in 
Qi until degree of each vertex becomes exactly d,fori G {1,2}. Let E" be the set of edges added to G' U M' 
in this manner, and let G" be the final graph. The graph G" satisfies all properties of a canonical graph in the 
family Q [d) except that it may not be a simple graph. We next transform G" into a simple d-regular graph by 
suitably modifying edges in E" . 

Given any graph H{Vh,Eh), we define 

^{H) = max{0, 7?(j;, y) - 1}, 

(x,y)£VH^VH 

where r]{x,y) denotes the number of times the edge (x,y) appears in H. Note that ^{H) = iff is a 
simple graph. Consider any edge {u, v) that has multiplicity more than one in G". It must be that (u, v) G E" 
since G' is a simple graph. Assume w.l.o.g. that u € Pi and v £ Qi. Let X C Pi and Y C Qi respectively 
denote the set of vertices adjacent to v and u in G". Using condition (d) on the graph Gr, we know that 

\E" n (Pi X Qi)\ > (^^ + 1^ _ (d + 1) > 2d^ 

Since \X\ < d and \Y\ < d, it follows that there must exist an edge (u', v') G E" n (Pi x Qi) such that 
u' ^ X and v' Y. We can thus replace edges {{u, v), {u' , v')} in E" with edges {(u, v'), {u' , v)} without 
violating the d-regularity condition. It is easy to verify that the exchange reduces $(G") by at least one, and 
that all edges involved in the exchange belong to the set E" . We can thus repeat this process until the graph 
G" becomes simple, and hence a member of the family Q{d). ■ 

Adversary strategy: For each vertexu G PU {t}, QU {s}, the adversary ^ maintains a list A^(u) of vertices 
adjacent to u that have been so far revealed to the algorithm V. Wlog we can assume that the algorithm V 
never queries a vertex u for which |A^(ti)| = d. At any step of the game, we denote by Gr the graph formed 
by the edges revealed thus far. We say the game is in evasive mode if the graph Gr satisfies the condition 
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(a) through (d) of Lemma[TOl and is in non-evasive mode otherwise. Note that the game always starts in the 
evasive mode, and then switches to non-evasive mode. 

When the game is in the evasive mode, in response to a query Q{u) by V for some free vertex u ^ Pi 
(i G {1, 2}), A returns an ai^bitrary free vertex v ^ Qi such that v N{u). The adversary then adds v to 
N{u) and u to N{v). Similarly, when V asks a query Q{u) for some free vertex u G Qi (i E {1, 2}), A 
returns an arbitrary free vertex v ^ Pi such that v N{u). It then adds v to N{u) and u to N{v) as above. 

As the game transitions from evasive to non-evasive mode. Lemma [lO] ensures existence of a canonical 
graph G Q{d) that contains the graph revealed by the adversary thus fai" as a subgraph. The adversaiy 
answers all subsequent queries by P in a manner that is consistent with the edges of G. The lemma below 
shows that the simple adversary strategy above forces 17 (d^) queries before the evasive mode terminates. 

Lemma 11 The algorithm makes Q{d?) queries before the game enters non-evasive mode. 

Proof: The adversary strategy ensures that conditions (a) thi^ough (c) in Lemma [lO] are maintained at all 
times as long as the game is in the evasive mode. So we consider the first time that condition (d) is violated. 
Since each query answered by the adversary in the evasive mode contributes 1 to the degree of exactly one 
vertex in Qi U P2, A always answers at least Q.{<f') queries before the number of vertices with degree less 
than I falls below ^■ in either Qi or ^2- The lemma follows. ■ 

Since A can not discover an edge in M' until the game enters the non-evasive mode, we obtain the desired 
lower bound of il(d^). 

5 An Q(n log n) High Probability Lower Bound 

In this section we prove the lower bound on the running time of a randomized algorithm for finding a matching 
in a regular bipartite multigraph stated in Theorem|3] We first reiterate that even though the algorithm obtained 
in section |2] is stated for simple graphs, the same runtime analysis applies for multigraphs as long as edge 
multiplicities are bounded above by d/2. The restriction on maximum edge multiplicity is necessary to ensure 
that SAMPLE-OUT-EDGE takes 0(1) time in expectation. In this section we show that every algorithm that 
finds a matching in a d-regular multigraph (even with edge multiplicities bounded above by d/2) probes at 
least (7/64) n In n locations in the input adjacency arrays with probability at least (on some fixed family 
of distributions). The lower bound instances use d = G(n). 

We first introduce the following problem, which we will refer to as BIPARTITE-DISCOVERY((i). 

Definition 12 (BIPARTITE-DISCOVERY {d)) Let G = {P,Q,E) be a bipartite multigraph with \P\ = Ad 
and \Q\ = d. The set of edges E{G) is constructed as follows. For each u £ Q choose d neighbors in 
P uniformly at random with replacement. A node u* £ Q is then marked as special, and edges incident 
to the special node are referred to as special. The graph G is presented in adjacency array format with 
edges appearing in random order in adjacency lists. When an algorithm A queries a neighbor of a vertex 
u £ Q or V £ P, an incident edge is returned uniformly at random among the yet undiscovered edges. The 
location of the edge in the adjacency arrays of both endpoints is revealed to A, i.e. it is no longer considered 
undiscovered when any of its endpoints is queried. The algorithm is not allowed to query the special node 
directly. 

Algorithm A solves BIPARTITE-DISCOVERY{d) if it finds an edge incident to the special node. The cost 
of A is defined as the number of queries that it makes before discovering an edge to the special node. 

We show the following: 

Lemma 13 Any algorithm that solves BIPARTITE-DISCOVERY{d) makes at least (7/2) din d queries with 
probability at least d~^ for any 7 > 0. 
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Proof: Suppose that the algorithm has discovered J edges of G. Then the probability of the next query not 
yielding a special edge is at least ^^^jzy^' independent of the actual set of edges of G that have already been 
discovered. Hence, the probability of not discovering a special edge after J < d? /3 queries is at least 

ii rf2_- -ii 2d73 - 

j=o j=o ' 

for sufficiently large d. Hence, we have that the probability of not finding a special edge after {'y/2)dlnd 
queries is at least d"'^. ■ 
We now give a reduction from BIPARTITE-DISCOVERY(c?) to the problem of finding a matching in a 
regular bipartite multigraph with edge multiplicities bounded by d/2: 
Proof of Theorem |3} 

Let A be an algorithm that finds a matching in a regular bipartite multigraph with edge multiplicities 
bounded above by d/2 and makes fewer than (7/64)n In n queries with probability at least 1 — n~'^ on every 
such graph. We will give an algorithm A' that solves BIPARTITE-DISCOVERY((i) and makes fewer than 
(7/2)6? In d queries with probability strictly larger than 1 — d^^. 

Consider an instance G = {P, Q, E) of BIPARTITE-DISCOVERY(d). Algorithm A' first checks if the 
degrees of all nodes in P are smaller than d/2. If there exists a node with degree strictly larger than d/2, A' 
queries all edges of all vertices in P and thus finds a special edge in at most queries. Note that since the 
expected degree of vertices in P is (i/4, the probability of this happening is at most e~'^ for sufficiently large 
d by an application of the Chernoff bound with a union bound over vertices of P. 

Now suppose that degrees of all nodes in P are at most d/2. A' adds a set of 3d vertices Q' to the Q 
side of the partition of G and connects nodes in Q' to nodes in P to ensure that the degree of every vertex 
in P and Q' is exactly d (it can be shown using an argument similar to the one in the proof of Lemma [TO] 
that this can be done without introducing double edges). Denote the resulting regular multigraph by G^ = 
(P, Q U Q' , E U E'). Note that G~^ has 4d vertices in each part, and one vertex in the Q part of the bipartition 
is marked special together with its d adjacent edges. Now A' constructs the final graph by putting together 
two copies of In particular, we denote by G^ a mirrored copy of G^, i.e. G^ = {Q U Q', P,EU E'), 
and finally denote by G* the graph obtained by taking the union of and removing the two special 
nodes and identifying special edges in G'^ with special edges in G^ . Note that any matching in G* contains 
a special edge, so algorithm A necessarily finds a special edge. Note that a query to an adjacency list in G'^ 
or G~ can be answered by doing at most one query on G. The number of vertices in each bipartition of G* is 
8d — 1 and the degree of each node is d. 

By assumption, algorithm A does not make more than (7Y64)nlnn queries with probability at least 
1 — n~"' for any 7' > 0. Setting n = 8d — 1 and 7' = 27, we get that A does not need more than 
(27/64)8(iln(8(i) < (7/2)(iln d queries with probabihty at least ly'^'^ > 1 - d~27 for sufficiendy 

large d. Hence, we conclude that A' probes at most (7/2)^ In d locations with probability at least 1 — d~'^'^ + 
e""' > 1 — d~"' , contradicting Lemma [T3l ■ 
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